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Abstract 

We use the relative modular operator to define a generalized relative 
entropy for any convex operator function g on (0, oo) satisfying g(l) = 0. 

We show that these convex operator functions can be partitioned into con¬ 
vex subsets each of which defines a unique symmetrized relative entropy, 
a unique family (parameterized by density matrices) of continuous mono¬ 
tone Riemannian metrics, a unique geodesic distance on the space of density 
matrices, and a unique monotone operator function satisfying certain sym¬ 
metry and normalization conditions. We describe these objects explicitly in 
several important special cases, including g{w) = — logtc which yields the 
familiar logarithmic relative entropy. The relative entropies, Riemannian 
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metrics, and geodesic distances obtained by our procedure all contract un¬ 
der completely positive, trace-preserving maps. We then define and study 
the maximal contraction associated with these quantities. 
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1 Introduction 


For quantum systems, a state is described by a density matrix P, i.e., a positive 
semi-definite operator with trace one. We will let T) denote the set of density 
matrices. For classical discrete or commutative systems we can identify the states 
with the subset of diagonal density matrices, each of which defines a probability 
vector p G R”. For commutative systems the usual logarithmic relative entropy 

Hiogip, q) = J2Pk ^og{pk/qk) ( 1 ) 

k 

can be generalized to 

Hg{p,q) = J2Pk9{qk/Pk) 

k 

where 5 ^ is a convex function on (0, cx)) with g{l) = 0. It is well-known that any 
such Hg contracts under stochastic mappings, i.e., Hg{Ap, Aq) < Hg{p,q) when 
A is a column stochastic matrix. Cohen, et al, defined the entropy contraction 
coefficient as 

Hg{Ap, Aq) 

P^q Hg{p,q) 

In the pair of papers [|^ |^ , it was shown that for each fixed A all the contraction 
coefficients associated with those g which are also operator convex are equivalent, 
more precisely 

Theorem 1.1 If g is operator convex, then 



(3) 


( 2 ) 


Vg{^) hlog(^) ^h|ui-l|(^)- 


(4) 


A summary of these results is given in [^. It suffices to mention here that the 
observation 


df tt ^ \ 

—Hg[p,p + tv) 


= /(O) Yl^'^kf/Pk = H^u,-ip{p,P + v) 


(5) 


i=0 


plays a critical role. The quantity Y.ki.'^kY/Pk can also be written as Mp{v,v) 
where 


Mp{u,v) = 




dadjd 


Hg{p + au,p + l3v) 


( 6 ) 


a=g=o 


is the Riemannian metric corresponding to the Fisher information. Cencov |^, ^ 
showed that, for commutative systems, this is the only Riemannian metric which 
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satisfies the monotonicity condition M^p^Av, Av) < Mp{v, v). Thus, we can regard 


Theorem as stating that for operator convex g the maximal contraction of the 
relative entropy and its associated Riemannian metric are the same. Since there 
is only one Riemannian metric, all the contraction coefficients must be equal. 
For quantum systems, the usual logarithmic relative entropy is given by 


ifiog(P,Q) = TrP(logP-logQ) 

roo r X 

= / TrP -- 

Jo Q tl 


(P-Q) 


1 


p + ti 


dt 


(7) 

( 8 ) 


with P, Q in P, the set of invertible density matrices. The integral representation 
IBI) can be used to show that 


Mp^{A,B) = 




dadjd 


Piog(P + aA,g + ;5P) 


a=/3=0 


TtA 


:B- 


1 


P + tl P + tI\ 


dt. 


(9) 


Although Mp^{A, B) is a monotone Riemannian metric, it is not the only pos¬ 
sibility; Mp{A,B) = TtA*P~^B is also monotone under completely positive, 
trace-preserving maps. The study of monotone Riemannian metrics on non- 
commutative probability spaces was initiated by Morozova and Cencov who 
did not, however, provide any explicit examples. A complete characterization of 
monotone Riemannian metrics (which includes the examples above) was given 
recently by Petz [|^, The quantum structure is much richer because left 

and right multiplications by P~^ are not equivalent. We will see that Mp{A,B) 
can always be written in the form TrA*Qp{B) where Qp reduces to multipli¬ 
cation by P~^ when P and B commute. Thus, for example, (i above gives 
f!p(B) = f, 


oo 1 
0 P+tl 


B 


p+ ti 


dt which becomes P ^B when P and B commute. 


Earlier, Ruskai tried to extend the entropy contraction coefficient results 
of Cohen, et al to non-commutative situations but obtained only a few preliminary 
results. Although one can formally dehne Hg{P, Q) = TtPqIQ/P) the expression 
Q/P is ambiguous in the quantum case. Using the non-standard dehnition Q/P = 
P~^PQP~^P , [which yields Hg{P,Q) = TTPlogP~^PQP~^P rather than 
when g{w) = — logtn.] Ruskai and Petz [Q were able to prove an analogue of 
Theorem llT] using the fact that 


dP 


Hg{P,P + tA) 


= /(O)TrAP-M 


( 10 ) 


t=0 


for all g. In essence, their convention for Q/P always yields the Riemannian 
metric Mp{A,B) = TrA*P“^P. 
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A better alternative is to use the relative modular operator introduced by 
Araki P, ||, ^ ^ ^ to define Q/P. This yields the usual logarithmic entropy 

(|^ and a rich family of generalized relative entropies. Moreover, differentiation 
then yields the entire family of monotone Riemannian metrics found by Petz 

,^, 0 - 

In this paper we use the relative modular operator to study both the relative 
entropies and Riemannian metrics associated with convex operator functions. 
For simplicity, we restrict ourselves to the matrix algebras associated with finite 
dimension systems. Although we do not believe this restriction is essential, it 
avoids many technical complications. [The most serious arises when the condi¬ 
tion TrP = 1 is not compatible with the requirement that P be invertible (in the 
sense of having a bounded inverse in relevant operator algebra). In that case, one 
must restrict the domain of Hg{P, Q) to those pairs P, Q which have comparable 
approximate null spaces in some suitable sense.] We show that each convex oper¬ 
ator function defines a convex family of relative entropies, a unique symmetrized 
relative entropy, a unique family (parameterized by density matrices) of contin¬ 
uous monotone Riemannian metrics, a unique geodesic distance on the space of 
density matrices, and a unique monotone operator function. We describe these 
objects explicitly in several important special cases, including g{w) = — logtc. 
We then define and study the contraction coefficient associated with the relative 
entropy, Riemannian metrics, and metrics. Finally, we examples showing that 
these contraction coefficients can have any value in [0,1] for a suitable stochastic 
map. 

The paper is organized as follows. In section 2, we give some basic definitions 
and results for relative entropy and Riemannian metrics. In section 3, we define 
the corresponding the geodesic distance, including the Bures metric as a special 
case. Finally, in section 4 we study the contraction of all the quantities under 
stochastic maps and give bounds on the maximal contraction. 




2 Relative Entropy and Riemannian Metrics 

2.1 Definitions 


We begin by describing the relative modular operator which was originally in¬ 
troduced by Araki to generalize the logarithmic relative entropy to type III von 
Neumann algebras 0 S 1,0,0, g. Later, Petz used it to generalize 
relative entropy itself. Let T> denote the subset of invertible operators in P. Let 
P,Q E P i.e., P and Q are positive definite matrices with Tr(P) = Tr((5) = 1. 
For matrix algebras, the relative modular operator associated with the pair of 
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states pp{A) = Ti{AP) and Pq{A) = Tr(y4Q) reduces to 

^Q,p = LqRp^, ( 11 ) 

where Lq and Rp are the left and right multiplication operators, respectively. 

Thus Aq^p{A) = QAP~^. It is easy to verify directly that Ag p is a positive 

Hermitian operator with respect to the Hilbert-Schmidt inner product. 

Definition 2.1 Let g be an operator convex function defined on (0, cxo) such that 
g{l) = 0. The relative g-entropy of P and Q is 

H,iP, Q) = Tr(pV2^(AQ,p)pV2). (12) 

We will let Q denote the set of functions satisfying these conditions. Note, how¬ 
ever, that the argument of g, as dehned here, is shifted from that (which we here 
denote gc) in 0] and 0 so that gc{w) = g{w + 1). Using standard results from 
the theory of monotone and convex operator functions, one can show that Q is 
the class of functions which can be written in the form form 

g{w) = a{w — 1) + b{w — 1)'^ + c- -h / - di>{s), (13) 

w Jo w + s 

where 6 , c > 0 and z/ is a positive measure on ( 0 , cx)) with hnite mass /q“ dv{s). 
The term may seem unfamiliar, as it is usually included implicitly in the 

integral. However, writing it separately will be convenient later and is necessary 
to ensure that the measure has hnite mass. The function g{w) = -~\ogw yields 
the usual logarithmic relative entropy (|^) which we continue to denote Piog(P, Q). 
The function g{w) = (tc — 1)^ yields 

= Tr(P - Q)P-\P - Q) (14) 

which we call the “quadratic relative entropy”; it plays an extremely important 
role in our development. The function g(pv) = {w — 1)^/ {w + 1) yields the equally 
important, but less familiar PBures(-P,Q) = At{P — Q)[Lq + Rp]~^{P — Q), where 
we use the subscript Bures because (as will be explained in section ^ it eventually 
leads to a geodesic on T> referred to as the “metric of Bures”. 

We will study the properties of relative entropy and related quantities under 
a class of maps referred to as “stochastic”. 

Definition 2.2 A stochastic map (f ■. Ai ^ A 2 is a completely positive, trace¬ 
preserving map from one von Neumann algebra to another. 
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For commutative systems, a stochastic map always corresponds to a column 
stochastic matrix, as discussed in the Introduction and in i i H- For non¬ 
section 


see 


or 


e.g., ig, 0) 


IS an 


commutative systems, a partial trace 
example of a stochastic map. General conditions can be obtained from the Stine- 
spring representation for completely positive maps or the subsequent work of 
Choi PI and Kraus |0| who showed that 0 is a completely positive if and only if 
there exist operators {14} with Vk '■ A\ ^ A 2 such that 


N 

<t>{A) = Z VtAV:. (15) 

k=l 

The condition that 0 is trace preserving is then J2k ^k^k = I [and not J2k VkVk = 
/, which is the condition that 0 is unital, i.e., 0(/i) = I 2 ] For algebras with trace 
(as is the case here) one can use the Hilbert-Schmidt inner product {A,B) = 
TtA*B to dehne the adjoint 0 of any completely positive map so that Tr4*0(i?) = 
Tr0(y4)*i?. It is then easy to see that 0(4) = J^k^kAVk and that 0 is trace 
-preserving if and only if 0 is unital. 


2.2 Relative Entropy 

We begin by dehning a relative entropy distance as a bilinear function on V 
with the properties we expect of the relative g-entropy Hg{P, Q). It is sometimes 
convenient to extend our dehnition from T> x T> to the somewhat larger set of 
pairs P, Q of positive dehnite matrices with TrP = TtQ. 

Definition 2.3 By a relative entropy distance we mean a function H{P, Q) sat¬ 
isfying: 

a) H{P, Q) > 0 with H{P, Q) = 0^ P = Q. 

b) H{XP, XQ) = XH{P, Q) for X>0. 

c) H{P, Q) is jointly convex in P and Q. 

In addition, we say that the relative entropy is monotone if 

d) H{P,Q) decreases under stochastic maps 0, 
that it is symmetric if 


e) H{P,Q) = H{Q,P) 
and that it is differentiable if 
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f) the function g{x, y) = H{P + xA, Q + yB) is differentiable. 


Conditions (b), (c), and (d) are not independent. It is well-known that by em¬ 
bedding in ® and choosing cj) to correspond to the partial trace 

over C^, one can show that (d) implies the subadditivity relation 

H{P, + F 2 , Qi + Q2) < H{P,, Q,) + H{P 2 , Q2). (16) 


But for functions satisfying the homogeneity condition (b) this is equivalent to 
joint convexity. Because any stochastic map can represented as a partial trace 
T^ , it follows that when (a) and (b) hold, then (c) (d). Nevertheless, the 


properties of convexity and monotonicity are each of sufficient importance to 
justify explicitly stating them separately. 

A relative entropy distance (even if symmetric) is not a metric in the usual 
sense, because it need not satisfy the triangle inequality. Nevertheless, such quan¬ 
tities have been widely used |^, ^ to measure the difference between P and 

Q. Later, we shall show that every relative g-entropy defines a relative entropy 
distance which then defines a Riemannian metric and an associated geodesic dis¬ 
tance. 


Theorem 2.4 Every relative g-entropy of the form given in Definition is a 
differentiable monotone relative entropy distance in the sense of Definition |il.5|. 


Proof: Properties (a), (b) and (f) are straightforward; (d) 
implies (c) by the above remarks. A simple new proof of (d) 

m 


is due to 

is given in Section 


Theorem 2.5 For each operator convex function g E Q, 

HfiP.Q) = i:i{Q-P)\hgP-^ + CgQ-^]{Q-P) 

-(Q - P)^ dvg{s) 


^“Tr((Q-P)- 


Lq -\- sRp 

= TT[{Q-P)R-p^g{AQp){Q-P) 
where bg, Cg and Vg are as in (M)- 
Proof: We hrst observe that 

(Ao,P - = (Q - = Rp-mi.Q - P). 

so that 


//„,_i(P,Q) = Tl' [P'P-(Q - P)P-‘'j = 0, 


(17) 


(18) 


(19) 


( 20 ) 
















and the linear term in (p!^ does not contribute. We also find using ([T^) again 

H,(P,Q) = {(Aa,P-/)(F‘''=),(Ae.p + s/)-‘(Ag,p-/)(F‘/ 2 )) 

\Q - P)(Aq,p + s/)-'flp-i(0 - P) 


= Tr 


= Tr(Q-F) 


Lq + sRp 


(Q - n 


( 21 ) 


Letting s = 0 yields 

Q) = Tr [{Q - P)Q-\Q - P)] = P) 

and one easily verifies that 

F(^_i) 2 (P, Q) = Tr((g - P)P-\Q - P)). 

Using these results in ([I3| ) gives the desired result (|T7|). 

It is worth pointing out that the cyclicity of the trace implies that 

Tr(Q - P)—^-—(Q -P) = Tr(g - P)_A_(g - P), 


Rp ~\~ sL 


Q 


Lp -\- sRi 


Q 


although 


ri'W - P) „ h, W -P)¥^ Tr(Q - P) „ h, (Q - P), 


Rp sL, 


Q 


Rq + sLp 


( 22 ) 


(23) 


(24) 


in general. 

One can also use the heat kernel representation 

roo 

(Aq,p + sl)-^ = / (25) 

to obtain another integral representation of Hg{P, Q). 

Theorem 2.6 Let mg{u) = e~'^^dv{s) denote the Laplace transform of the 

measure Vg. Then 

Hg{P, Q) = hgH(y,_l)2 (P, g) + CgH(^^_l)2 (g, P) + 

noo 

+ / H {P,Q)mg{u)du. 

0 

where we formally extend our definition of Hg{P,Q) to the non-convex function 
g{w) = {w — 
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Proof: We use 


in 


( 0 . 


r((Ag,p - (Aftp + i!/)-‘(Ag,p - I)(P'P))dy,{s) 

'J 0 

roo 

= / {(Ag,p-/)(P‘/=),e-“^0'UAg,p-/)(P‘'=)K(u)* 

poo 

= / TiiiQ - P)iRp-ie-^^'^’n{Q - P))mg{u)du 

poo 


/ H^^_i)2^-m.{P,Q)mg{u)du 

0 


where we have interchanged the order of integration and then used (J^) again. 


2.3 Monotone Riemannian metrics 

We now consider the relation between relative g-entropy and Riemannian metrics. 
Note that the set of density matrices V has a natural structure as a smooth 
manifold, so that we can dehne a Riemannian metric on its tangent bundle 
whose hbers consist of traceless, self-adjoint matrices or 

TpV = {A = A*: TrR = 0}. (26) 


Definition 2.7 By a Riemannian metric on V, we mean a positive definite bi¬ 
linear form Mp{A,B) on TpP such that the map P —> Mp{A,A) is smooth for 
each fixed A G T^,V. The metric is monotone if it contracts under stochastic maps 
in the sense 


M^(P)[0(R),0(R)] <Mp(Ai?) (27) 

when (j) is a stochastic map. 

Note that this dehnition of monotone requires that the stochastic map act on 
the base point (i.e., the indexing density matrix P) as well as the arguments of 
the bilinear form. 


Theorem 2.8 For each g ^ G and density matrix P ^ V, 


MUA,B) 


{A,a‘p{B)) = TrA’piB) 


(28) 

(29) 


defines a Riemannian metric on TpV, and a positive linear operator Qp on TpV. 
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The theorem follows easily from the fact that Rp, Lp and their inverses are 
positive semi-definite operators with respect to the Hilbert-Schmidt inner prod- 
net, e.g., TtA*RpA > 0, and the integral representation in Theorem p.5| . We 
hnd 


{A,n%{B)) = {hg + Cg)i:i[AL-p\B) + BLp\A)] + 

poo J- -| 

+ / Tr [A{Lp + sRp)-\B) + B{Lp + s/?p)-'(A)J dug{s) 

J 0 

= {hg + Cg)TiA[L-p^ + Rp\B) + 

{Lp sRp) ^ -|- {Rp -|- sLp) ^ {B^dvg{s) 


POO 

+ / TyA 
Jo 

POO 


TtA 
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= A 


{Lp + sRp)-^ + {Rp + sLp)-^\ {B)Ng{s)ds 
{Lp + sRp)~^ + {Rp + sLp)~^ {B) Ng{s)ds) (30) 


where, for simplicity, we temporarily snbsnme the qnadratic terms into the inte¬ 
gral by dehning Ng so that Ng{s)ds = {bg + Cg)6{s)ds + dh'g{s). It is critical that 
A and B are self-adjoint so that we can interchange A and B by replacing Lp by 
Rp as in 


TrBLp\A) = TtBR-^A = TiABP-^ = TiARp^B). (31) 


This resnlt would not hold if we did not require the perturbations of P and Q 
to be self-adjoint. Given that requirement, the result is necessarily symmetric in 
the sense that we get the same result from both L[g{P, Q) and L[g{Q, P). This is 
already evident in the quadratic term, whose coefficient depends only on the sum 
b + c, and will be discussed further below. 

We can now use (^) to obtain several explicit formulas for Qp. 


QPp 


+ 


>0 \sRp “1“ Lp sLp Rp 

poo ^ 


Ng{s)ds 


lo sRp Lp 

f'OO ^ 


R 


-1 


Jo s -|- Ap^p 
1 / 1 


0 \sRp Lp sLp -\- Rp 


(iVg(s) + s ^Ng{s ^)) ds 

— ag{s)ds 

1 \ 

agisjds, 


(32) 

(33) 

(34) 

(35) 


where we have used the change of variable s —+ s ^ and 


ag{s) = Ng{s) + s ^Ng{s ^). 
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Note that ag{s = sag{s). Then, if we dehne 


we hnd that k{X 
terms of g as 


k{X) = j — —ag{s)ds 

'J 0 


0 s + 

1 r 1 1 

+ 


( 36 ) 


/o 


.s "h A sA T 1_ 


ag{s)ds, 


= Xk{X), Qp = Rp^k(App), and that k can be expressed in 


k(w) = 


g(w) +wg(w 


(w — 1)2 

We will let /C denote this set of fnnctions, i.e., 

/C = {fc : —k is operator monotone, k{w~^) = wk{w), and k{l) = 1 }. 


(37) 


(38) 


We have recovered half of Petz’s resnlt [El, ESl ETI that there is a one-to-one 


correspondence between symmetric Riemannian metrics and fnnctions of the form 
(p 6 D which satisfy the normalization condition /c(l) = 1. (But note that our k 
corresponds to 1// in Petz’s notation.) Our approach also easily yields an explicit 


expression for both Qp and its inverse. 


Theorem 2.9 For each g G Q and P E T>, the operator Vt^p as defined in Theorem 
\2.di satisifes Xlf, = Rff k{LpR'ff) and [Op]“^ = Rpf{LpRf}) where k{w) is given 
by and f{w) = l/k{w). 

Although Qp is initially dehned only on T*P, it can easily be extended to all 
traceless matrices using the natural complexihcation TrA = 0 A = Ai -|- iA 2 
with Ai, A 2 G TpV and then to all of using linearity and Op(/) = P~^I. The 
result is equivalent to using any of the formulas for Op above together with the 
obvious extension of Lp and Rp to all of We can summarize this discussion 

as follows. 


Theorem 2.10 For each g E G and P eT), the operator Op as defined in Theo- 
rem 12.^ can he extended to a positive linear operator on so that Mp{A, B) = 
TrA*Op(i?) defines an inner product on On the other hand, for each g E G 

and P eV equation defines a positive linear operator Qf. on all and 

the bilinear form Mp{A,B) = TrA*Op(i?) extends to a monotone Riemannian 
metric satisfying the symmetry condition Mp{A,B) = Mp{B*,A*). 

This result is essentially due to Petz 0 , 111 , 0 , who also showed the converse 
result that every symmetric monotone Riemannian metric is of this form. We 
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give an independent proof of monotonicity at the end of this section. That the 
metric is symmetric is a conseqnence of the cyclicity of the trace. 

The following result is essentially due to Kubo and Ando |]^ who developed 
a theory of operator means. 

Theorem 2.11 If k given by ([5^ satisfies k(l) = 1, then for all P,Q eV 


Rp + Lq > Rp k[A.Q^p) > {Rp + Lq) 


-1 


(39) 


Proof: This follows easily from (PB|), the elementary inequality 


w + 1 ^ l + t 
2w ~ 2 


+ 


t + w tw + 1 


> 


tc + 1’ 


(40) 


and the fact that the normalization k{l) = 1 implies that 2ag{t)/{t + 1 ) is a 
probability measure on [ 0 , 1 ]. 

As immediate corollaries, we hnd 

= Lff + Rf} >9fip> {Rp + Lp)-^ = (41) 

A) > Ml,{A, A) > A) (42) 

^5“ 1)2 (P, Q) > Hf^{P, Q) > Hp^UP. Q) (43) 

where the superscript indicates the symmetric relative entropy associated with g. 
Thus k{w) = 2/{w + 1) corresponds to the minimum symmetric relative entropy 
and minimum Riemannian metric among the class studied here. By contrast, we 
will see that g{w) = {w — 1 )^ corresponds to k{w) = {w + l)/{2w) so that the 
quadratic relative entropy is maximal. 

The operators Qf, and are non-commutative versions of multiplication 

by P~^ and P respectively. Hence, in view of the cyclicity of the trace, the 
following result is not surprising. 

Theorem 2.12 The operator QPp given by satisfies Tr QPp{A) = TrAP“^ 
andAi[Vt%]-^{A) = TrAP 


Proof: We first observe that in a basis in which P is diagonal with eigenvalues 
Pk 


[Rp 


-1 


s A 


p,p 


-{A)]jk = 


sRp P Lp 


{A)]jk = 


SPk + Pj 


^jk 


(44) 


so that 

[nfi{A)]jk = [ — ^ — ag{s)ds. (45) 

Jo spk + Pj 


13 
















Then for every g E Q,P EV, and A G TpV 


TiSlUA) = Z 


a 


'3J 


= Et 


^ .0 SPj+Pj 

poo 1 
-1 / 


ag{s)ds 


'j 


j 


Jo s + 1 


ag{s)ds 


= fc(l)TrP-M = TrP-M. 


The proof for the inverse is similar. Since 1/k is also operator monotone, we can 
use Theorem to conclude that can be written in the form 

POO 

= aRp + bLp- f dp{s). 

Jo sRp J-/p 

for some positive measure p. 


2.4 Correspondence between defining functions 

We now make some remarks on the relation between g{w),wg{w~^), and k{w). It 
should be clear from the development above that every function g E Q dehnes a 
Riemannian metric and a function k as in (^) or (|57|). If we now consider g{w) = 
wg{w~^), it is easy to verify that g{w) E Q as well and that H^{P, Q) = Hg{Q, P). 
Thus, the map g{w) —> wg{w~^) has the effect of switching the arguments of the 
relative entropy and the function g{w) +wg{w~^) yields the symmetrized relative 
entropy Hg{P,Q) + Hg{Q,P). Now, if we begin with a function g and relative 
entropy Hg{P,Q, the differentiation in (pH]) automatically yields a symmetric 
result. Thus, all convex combinations ag{w) + (1 — a)g{w) of g and g{w) yield 
the same Riemannian metric and the same function k E 1C. 

Conversely, every k E 1C dehnes a unique symmetric relative entropy via the 
function g^^'^{w) = {w — l)‘^k{w). It follows immediately from the integral repre¬ 
sentation (^) and ([T^) that g^^'^{w) is also in Q and that wg^^'^{w~^) = g^^'^{w). 
Thus, k selects from the convex set of relative entropies associated with a given 
g E Q, the symmetric one. If we observe that the integral representation (p6D is 
equivalent to —k being an operator monotone function, we can summarize the 
discussion above as follows. 

Theorem 2.13 There is a one-to-one correspondence between each of the follow¬ 
ing 

a) monotone Riemannian metrics extended to bilinear forms via the symmetry 
condition Mp{A, B) = Mp{B*, A*), 
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b) monotone (decreasing) operator functions satisfying k{w = wk{w) with 
the normalization /c(l) = 1 , and 

c) convex operator functions in Q which satisfy the symmetry relation wg{w~^) = 

9{w). 


The relations between these are given by (|3^), (|36|), and (|37|). In view of 
this theorem, it would be appropriate to identify a given operator QPp by us¬ 
ing the (unique) symmetric function However, we will continue to use the 

asymmetric g for such familiar cases as the logarithm. One might expect the 
one-to-one correspondence to extend to twice-differentiable symmetric monotone 
relative entropies. However, Petz and Ruskai consider relative entropies of the 
form Hg{P,Q) = TTPg(P~^^^QP~^^‘^). This class of monotone relative entropies 
can be symmetrized; however, differentiation of Hg yields the Riemannian metric 
B) = PiA*[p-^B + BP-^] for all g eg. Thus, in particular Q) 

= TrPlog(P~^/^QP“^/^) is an example of a relative entropy distance which is not 
a relative g-entropy in the sense of Definition |2]^. Another class of distinct rela¬ 
tive entropy distances is given by squares of the geodesic distances introduced in 
Section 3. Thus, the properties in Definition 1.3 are not sufficient to completely 
characterize the relative g-entropy and allow us to extend the one-to-one corre¬ 
spondence in Theorem 2.4 extend to a class of relative entropies. Although we 
believe that such an additional condition must exist, we have not found it. 


2.5 Examples 

We now give explicit expressions for the relative entropy, Qp and related quantities 
in several important special cases. Thiese examples will also illustrate the relation 
between the functions g, g, and k discussed above. 

Example 1: Take g{w) = — logw. Then g{w) = w\ogw, 5 ^®^™ = {w — l)logr(;, 
k{w) = (tc — 1)“^ logtc, Ng{s) = (s-t-l)“^ and o'g{s) = l/(s-l-l). Then, Wiog(P, Q) 
is given by (0 ), and 

= W(^_i)iog^ = Tr(P-g)[logP-logQ] 

and 

^log _ ^ _I_ ^ ^ ^ 

^ Jo \sRp + Lp sLp + Rp) {s + iy 


Jo s ~h 1 Lp sRp 


(46) 

(47) 
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Making the change of variables s —^ sRp in the last integral, yields 

^log ^ 1 - 1 

Jo S “h J^p S ~\~ itp 


( 48 ) 


SO that 


{ki,f]p®(s)) = Tr Ha* 
Jo 


-B- 


sl + P si + P 


ds, 


(49) 


a result that we obtained earlier using the integral representation (|^) or 


log P - log Q 


roo 

1 

1 

Jo 

Q + xl 

P + xl 


dx. 


(50) 


In this case, it is also well-known JT^ that the inverse operator can be written 
as 


r PBP^-^dt. 

Jo 


(51) 


Example 2: Take g{w) = {w — 1)^. Then g{w) = {w — = 

{w — l)‘^{w + l)/w and k{w) = (w-|-l)/(2ic). Then P(^_i) 2 (P, Q) is given by (P^), 


= Tr(Q-P) 
VL 


p-^ + Q-^ (Q-P), 




= Rl} + Ll} 


and 

= P(^_i)2(P,P + kl) = Trkip-^A 

The associated function is the maximal function satisfying the prescribed condi¬ 
tions. The operator (P) = P~^B + BP~^ so that 

+ Lp^ = R-^[Rp + Lp]Lp\ (52) 


Example 3: For sq > 0 take gso{w) = {w — l)^/(tc + sq). Then gso{w) = 
{w - 1)^/(1 -l-wso), 5'®y“(tc) = {w- lY{w -k 1)(1 -f- so)/(l -ktcso)(tc -t- sq), k{w) = 
{w + 1)(1 + So)/(l + wso){w + So), and Ng{s) = 6{s - Sq). Thus 


np 


1^1 
SgPp -|- Tp SqLp P Rp 
(■So + l)[so-Rp +-hp] ^[Rp + Lp][Rp + sqLp] 


(53) 
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When no confusion will result, it will be convenient to employ a slight abuse 
of notation and write ffp for ilp°. The case sq = 1 is particularly important; 
we have already seen that it yields the minimal k E fC. Then k{w) = 

g{w) = = [Rp + Tp]“\ The corresponding 

Riemannian metric is (A, r2p™®®(i?)) = Try4*[i?p + Lp]“^(i?) and the correspond¬ 
ing relative entropy 

HpuresiP, Q) = Tr(Q - P)[Rp + Lq]-\Q -P) = TtQXPX (54) 

where X = [Rp + Lq]~^{Q — P). Because of the cyclicity of the trace, insures(-P, Q) 
is already symmetric and [Rq -|- Lp]“^ would have given the same result. 

Example 4: Take g{w) = 1 - w^. Then k{w) = and Ng{s) = 

^(l + s)“-2. Thus 

Hi_^c{P,Q) = l-TrQ“pi-“. 


Qi = R7^ 


sm ITS 


^ Jo si -\- Xp p TT 
After the change of variables s sRp this becomes 


(1 -h s)" ds. 


Pp“" + 


sinvrs 1 

TT JO Lp ~\~ s i^Rp ~\~ 


ds. 


(55) 


2.6 Monotonicity proof 

We now present a new proof of the monotonicity of the relative entropies and 
Riemmanian metrics associated with convex operator functions. 

Theorem 2.14 For every convex operator function g of the type considered here, 
both the relative entropy Hg{P,Q) and the corresponding Riemannian metric are 
monotone, i.e. 


H,(P,Q)<H,\4,(P),4,(Q)], (56) 

{Aa%A) < (57) 

This result is essentially due to Petz |^. We give an independent proof as an 
immediate corollary of the following theorem and the integral representations (0 
and ([5^). 
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Theorem 2.15 If (f) is stochastic 


TtA* 


Rp sL 


-A = Tr(/)(A* 


-A 


Q 


\ Rp A 
> Tr4>(A-)- -- 4,(A). 


(58) 


Proof: If P > 0, then Hi A* PA > 0 and Tr 74 * 74 P > 0 so that both Lp and Rp are 
positive as operators on the Hilbert-Schmidt space. Thus for Q > 0, the operator 
Rp + sLq is also positive. Let X = \Rp + sLq\~^/‘^{A) — \Rp + sLqY/‘^6{B) with 
B = iC) + Theu ^X-X > 0 so that 


TrA'- - - A - TiA'MB) - TiMB')A 

Rp + sLq 

+Hi^{B*)[Rp + sLq]^{B) > 0. 


(59) 


Since it is easy to see that 


-HiA*^{B) - Hi^{B*)A = -2Hi<i){A*)- --0(/l), 

-R</.(p) + 


the desired result will follow if we can show that the last term in (M) is bounded 
above by the right side of (|58|). We find 


Tr0(P*)[Pp + 5LQ]0(P) = 


< 


Tr0(P*)0(P)P + ${B*)sQ${B) 
Tt${B*)${B)P + ${B)${B*)sQ 
Ht${B*B)P + ${BB*)sQ 


where the inequality follows from positivity of P and Q and the operator inequal¬ 
ity 


${B*)${B) < ${B*B), (60) 

which holds for any B because the trace-preserving condition on 0 gives (piR) = 
R. Then using, e.g., Tr0(P*P)P = TrP*P0(P), we find 


Tr${B*)[Rp + sLq]${B) < 


TrP*P0(P) + BB*s(j){Q) 
TrP*[P0(P)+s0(Q)P] 

TrP*[P<ji(p) sL^(q)]B = HiB*( j){A) 

T^4>(A‘)- - P— - 4>(A). 

ti^{P) A sL^(^q) 
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It is interesting to observe that the strategy used here is very similar to that 
used by Lieb and Ruskai to prove a Schwarz inequality for completely posi¬ 
tive mappings and, as a special case, the monotonicity of the quadratic relative 
entropy. At that time, Lieb and Ruskai could use these Schwarz inequalities to 
prove many special cases of the strong subadditivity of the logarithmic relative 
entropy, but not the general case. A complete proof of strong subadditivity 


see also Q) seemed to require one of the convex trace function theorems of 
0- It is therefore curious that now, some 25 years later, we have hnally 


Lieb 


found a way to recover strong subadditivity directly from the Schwarz strategy 
of Lieb and Ruskai 


18 


It should also be noted that Uhlmann had earlier used a very different 
approach (based on interpolation theory) to show the logarithmic relative entropy 
was monotone under a related class of mappings that are Schwarz in the sense 
0(A*A) > 0(A*)0(A) and Petz 


23| extended this to other relative entropies. 


3 Geodesic distance 

We now wish to consider the contraction of the relative entropy and corresponding 
Riemannian metric under stochastic mappings. Before doing so, it will be use¬ 
ful to consider the geodesic distance which arises from the Riemannian metrics 
considered here. 


Definition 3.1 Associated with every Riemannian metric {A, Qp{B)) of the form 
(0) is a geodesic distance Dg{P,Q) which is defined as 


D,(P,Q) = mf/ J(S((),SJ»(„S(()>* 


where the infimum is taken over all smooth paths S{t) with 5(0) = P and 5(1) = 

Q. 


Theorem 3.2 The square [Dg{P,Q)]‘^ of every geodesic distance of the form 
given in Definition \3.1\ is a differentiable monotone relative entropy distance in 
the sense of Definition fS. 5| . In addition, Dg{P,Q) satisfies the triangle inequality 
Dg{P,R) < Dg{P,Q) + Dg{Q,R). 

Proof: Properties (a), (b) and (e) of Dehnition [1.3| are readily verified. Property 
(d), i.e, the monotonicity Zlg[0(P), 0(Q)[< Dg{P,Q) can be proven directly, but 
also follows easily as a corollary to Theorem below. The triangle inequality 
is standard. That Dg{P + xA, Q + yB) is differentiable in the sense of Dehnition 
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p^(f) follows from standard results (see, e.g., Theorem 3.6, part (2) of ||T3[|). 

QED 


It is well-known 


see, e.g., 
2 


p3| , |35[| ) that the metric associated with the min- 

, is (e 

the metric of Bures, i.e., D2(w-i)‘^/{i+w){P, Q) = Q) where 


imal function k{w) = discussed in Example 3, is (except for normalization) 


[^Bures(p,g)]2 _ {Tr(iy - X){W - X)* : WW* = P, XX* 

= 2[l-^r{^/PQ^/pyP 

< Tr[v/P-^]2 = 2[l-Trv^^]=Pi_^(P,Q). 


It follows immediately from f[fll) that 


Q] 

(61) 

(62) 


Q) > D,{P, Q) > 4P^”-(P, Q). (63) 


so that 4P^“''®®(P, Q) gives the minimal geodesic distance of this type. 


4 Contraction Under Stochastic Maps 

4.1 Contraction coefficients 

Because the relative entropies, Riemannian metrics, and geodesic distances all 
contract under stochastic maps, their maximal contraction is a well-defined quan¬ 
tity in the following sense. 


Definition 4.1 For each fixed convex operator function g of the form given in 
Def. \2.1\ and stochastic map f we define three entropy contraction coefficients 


RelEnt 

Ig 


Riem 

Ig 


( 0 ) 

( 0 ) 


7 r"( 0 ) 


sup 

P^Q 


P,[0(P),0(Q)] 

HAP,Q] ’ 

(0(y4),fl®(p)[(/)(y4)]) 


sup sup 

P A&TpV 


sup 

P^Q 


{A,ai,iA]) 

[DMP)A(.Q)W 

ID,{P,QW ' 


(64) 

(65) 

( 66 ) 


In Q it was shown that for commutative systems, = r7^^®™(0) = 

'r](^w-i) 2 {(j)). Here, we will prove some relations between these various r]. 

Theorem 4.2 The three contraction coefficients defined above satisfy 

1 > > Ilf “(^) > Iir‘‘('#-)' (67) 
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The intuition behind the second inequality can be seen by letting A = B = 
Q — P in the integral representations of Theorems 2.2 and 2.3. Then the only 
difference between the ratios in (|6^ and (^) is that the modular operator in the 
former is Ag^p while that in the latter is Ap_p. This would seem to indicate that 
the first supremum is taken over a larger set. However, the two are not directly 
comparable because the condition P ^ Q in the hrst case precludes the choice 
App. Hence, we consider Q = P + eA. 

Proof: The upper bound of 1 follows immediately from Theorem |2.14 To prove 
the second inequality consider, as suggested above, 

Hg{P,P + eA) = Proceeding as in the proof of The¬ 

orem ^]^but with the shorthand dNg[s) = {bg + Cg)6{s)ds + dug^s), we obtain 


Hg{P,P + eA) = e^ 


Tr 


A- 


poo 


e" / Tr 
Jo 

2 


A 


Lp+eA + sRp 

^ (^) 


(^) 


dNg{s) 


Lp -\- sRp 

^ 0 (p)( 0 (^))) + 0{e^). 


dNgi^s) -|- 0{e^) 


Thus 


RelEnt 

Ig 


( 0 ) = sup 

P^Q 


P,[0(P),0(Q)] 

Hg[P,Q] 

P,[0(P),0(P + eH)] 


> sup sup _ 

p AeT:,v Hg{P,P + eA) 


However 


Pg[0(P),0(P + eH)] _ (0(A), [0(H)])+ 0(e 


Hg{P,P + eA) 


{A,n%[A])+0{t 


Since the quantity on the right can be made arbitrary close to “(0), we con¬ 
clude that ?7^®^^’^*(0) > ?7^'®™(0). Finally, to prove the third inequality we hrst 
choose Soit) to be a minimizing path for Dg{P, Q), i.e. 


10 


DAP.Q)= / 


Then, 0 o 5*0 is a smooth path from 0(P) to 0(Q). Moreover, the linearity of 0 
A 

dt 


implies that ^0 o So{t) = 0 o So{t). Thus 

ri 


^<?[0(Q),0(Q)] < / d{<P°So{t),QloSoit)^°^o{t))dt 




11/2 


{s,{t),ni^^^So{t))dt 


= [vf^^{4>)]P^Dg{P,Q). 
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Dividing both sides by Dg{P, Q) and taking the snpremnm of the left hand side, 
gives the desired resnlt. QED 

In this case of the hrst inequality (0) > 77 ^^®™ (0), we proved slightly 

more, namely, that either equality holds or the snpremnm in 77^®^^'^*(0) is actu¬ 
ally attained for some non-negative (but not necessarily strictly positive) density 
matrices P,Q, i.e., strict inequality implies that there exists P ^ Q E V such 
that 


HMnm)] = ( 68 ) 


This follows from the fact that we can always hnd a maximizing sequence (P^, Qk) 
such that 


P,[0(Pfe),0(Q)fe] 

fc^OO Hg{Pk,Qk) 


RelEnt 

I9 


(</>)• 


Since we are in a hnite dimensional space, the space of non-negative density 
matrices is compact so that we can hnd a convergent subsequence {Pkj,Qkj) —^ 
(P, Q). Then either P = Q in which case we necessarily have 77 ^®^^“^*(0) = 77 ^'®™( 0 ) 
or (|68|) holds. (Strictly speaking, we must also exclude the possibility that both 
Hg{Pk, Qk) and Pg[ 0 (Pfc), 0(Qfc)] diverge to cx).) We expect that for most choices 
of g equation (|68D holds only in very special cases [see, e.g., the partial trace 
example in Section |4.4| which yield ? 7 ^®*^°*( 0 ) = 1 = 77^'®™(0).] Indeed, even 
for commutative systems, early proofs 0 , 0 that equality holds for rjiagi^) = 
r]{w_i) 2 [A) depended on a demonstration that (|68D could not hold in general. 

Another special situation occurs for the minimal g which yields the Bures 
metric. If P, Q commute, then 


HBuresiP,Q) = Tr(P - Q)([Lp + Pq]+ [Lq + Pp]-')(P - Q) 

= 2Tr(P-g)(P + Q)-i(P-g) 

= 2((p-g),D^’tQ[(p-g)]). 

Thus if the snpremnm for 77g(fj,™(0) happens to be attained for a commuting pair 
P, A (with P G P and A G TpV) whose images 0(P), 0(A) also commute, then 

PBures[0(P + A), 0(P - A)] = 77 ^^” (0)PBures(P + A, P - A). (69) 

If hBures°*(*?^) = '^Burffi (*?^)5 then this also yields equality in (H); however, it does 
not give strict inequality for ?7Bure^*(0) — hBur^(0)- contrary, it seems to 

offer some heuristic support for equality. 

We expect that in those exceptional situation in which the snpremnm 
is attained the result is equal to 77 ^'®™( 0 ) so that equality always holds, at least 
for the hrst inequality in Theorem [4.2|. 
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Recall that many common choices for g [e.g., g{w) = {w — 1)^ or g{w) = 
— logtn] do not yield a symmetric relative entropy, i.e., Hg{P,Q) ^ Hg{Q,P). 
This raises the question of whether or not the entropy contraction coefficient 
[which we denote symmetrized relative entropy 

i/f “(P, Q) = HgiP, Q) + Hg{Q, P) = P,H+^,(^-i)(P, Q) (70) 

is the same as {(p). Although we believe equality holds, we can only prove 

that 




(71) 


Nevertheless, Theorem holds for any g. In fact, since there is a unique Rie- 
mannian metric associated with all g which yield the same symmetrized relative 
entropy, we have (0) > To prove ([7l|) it suffices to 

observe that 


Q) = “(^1 Q) = Hg{P, Q) + Hg{Q, P) 

so that 


= HMP)Am + H,WQ)A(p)] 

< ’lf‘"^'"{<l>)H,(P, Q) + P) 

In the case of the quadratic entropy, it easily follows that ((p) = (0) = 

Finally, we note that the joint convexity of relative entropy, Riemannian met¬ 
rics, and [Dg{P,Q)]‘^ imply that the corresponding contraction coefficients are 
convex in (p. (Although we did not explicitly state the joint convexity for Mp{A, A) 
it is an easy consequence of homogeniety and contraction under partial traces.) 


Theorem 4.3 For each fixed g G Q, each of the contraction coefficients {(p), 

^Riem( 0 )^ and g^^°A<t>) convex in cp. 

Proof: Since the argument is straightforward, we give details only for the relative 
entropy. Let (p = x<pi -|- (1 — a:) 02 - 

^<7[0(^)XQ)] = Hg[xcpi{P) + {1 - x)(p 2 {P),x(pi{Q) + {1 - x)(p 2 {Q)] 

< xHg[MP),MQ)] + {i-x)Hg[(P2{P),HQ)] 


RelEnt 
9 

RelEnt 
X 'In 


< X Q) + (1 -1^) Ilf WHAP, Q) 

(.).,)+ (l-l)l)“(^2 )li/„(P,Q). 


Dividing both sides by Hg{P, Q) implies 


^RelEnt + (1 


^)^RelEnt(^^)_ 
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4.2 Eigenvalue formulation of 77 ^^®“(0) 

We now show how is related to the following set of eigenvalue problems: 

[$oni^^^ocj,]{A) = xnUA). (72) 


In view of Theorem p.lOl , this is a well-defined linear eigenvalue problem on 
for each fixed pair 0 and P. The following remarks are easily verified. 

a) The eigenvalue problem (|7^) can be rewritten as $p o 0(i?) = \B where 

= (73) 

Furthermore, <Fp is trace-preserving. This follows from Theorem 2.12 and 

= {/, S) = TrS 


b) We can assume without loss of generality that matrices which are eigenvec¬ 

tors in (^) are self-adjoint, i.e., that A = A*. Indeed, it is easy to check 
that the operator r2p°(4) = {sRp + Lp)[Rp + Lp]~^{Rp + sLp){A) satisfies 
[f2p“(4)]* = r2p°(4*). Therefore, the operators hip, 0, 0 and d>p all 

map adjoints to adjoints. 

c) For each hxed P, the eigenvalue equation is satisfied with A = P and 
eigenvalue A = 1 which is the largest eigenvalue. The operators on both sides 
of (l7^) are self-adjoint (in fact, positive definite) with respect to the Hilbert- 
Schmidt inner product and the corresponding orthogonality condition for 
the other eigenvectors reduces to Pi A = 0. 


In view of these observations, it is easy to conclude from the max-min principle 
that the second-largest eigenvalue A 2 ( 0 , P) satisihes 


’ - /A 09 

A&TpV {A Alp 

for each fixed P. Then taking the supremum over P yields 

Theorem 4.4 For each g E Q and stochastic map 0 

<”“(•#') = supA|(,(..B). 

P&V 




(74) 


(75) 
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We have already observed that every QPp can be regarded as a non-commutative 
variant of multiplication by P~^. Indeed, if both pairs of operators P, A and 
(j){P), (j){A) associated with a particular eigenvalue commute for some g, then 
QPp{A) = Rp-i{A) = Lp-i{A) for all g and the corresponding eigenvalue equa¬ 
tions are the same. It may be tempting to conjecture that the eigenvalue equations 
for different g are related by a similarity transform, which would then imply that 
all A2(0, P) are equal so that all are identical. However, for a given hxed 

P, Rp and Lp commute, which implies that ffp and flp commute for any pair of 
functions g and h. Since commuting operators are simultaneously diagonalizable 
and similar operators have the same eigenvalues, this would imply that all of 
the eigenvalue operators B —>■ o 0 o o </> {B) are identical. This is 

easily seen to be false in specihc examples. Moreover, as discussed at the end of 
Section [4.4| one can hnd examples of non-unital (j) for which different are 

not identical. 


Theorem 4.5 We can rewrite the eigenvalue problem so that 
A,(0,P)-snp • 

where the supremum is now taken over |a G Range(0) : Tr[f2®^p^]“^(a) = o|. 


Proof: 


^ 2 ( 0 ,^) = sup 


{<P{A),Ql^p^[<P{A)]) 


A:TriA)=0 {A,nUA]) 

O 0 O 0 ( 1)0 [H|,]“^P]P) 


sup 

B:Tr[Of,]-l/2(B)=o 


{B,B) 


If we now write T = o 0 o we see that P) is the largest 

eigenvalue of r*r where T maps 

{b : Tr[ff?,]-V2(5) = 0 } ^ {/3 e Range(0) : Tr[f]^(p)]-i/2(0) = o} . 

Since FT* and r*r have the same non-zero eigenvalues. 


A!(0,P) 


(0[^0(P)] / O 0 O [^p] O 0 O [^^(P)] ^ ]0) 

- 7^\ - 

/3:Tr[n3(^j]-l/2(/3)=0 {P,P) 

(0(«)[^p]~^0(«)) 
a:Tr[0®^pj]-l(o)=0 [^(/,(P)] 
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If we apply this result with = [Rp + Lp] it follows easily from the 

theorem above that 


Ap“(^,P) 


Tr0(a)P0(a) 

sup ^ - 

a:Tr 0 (P)o=o Tra0(P)a 


(76) 


It is tempting to write (j){P)a = 13 = 4>{B) and replace the constraint Tr0(P)Q! = 0 
by TrP = 0. The denominator would then become which 

has the same form as the numerator in (| 6 ^ when k{w) = (corresponding to 
g = {w — 1)^). However, we there is no guarantee that ^(a) = 0([0(P)]“^]P). 
On the contrary, this cannot possibly hold because we would then have that 
the A (and hence t]) for the two extremal functions k{w) = and k{w) = 
are inverses, which is inconsistent with A®(0, P) < 77 ^^®™( 0 ) < 1 (except 
in the case A = 1 which is not generic). There is, however, a sense in which 
the operators associated with these two extremal functions are inverses since 
+ Lp^ = p-i[Pp + Lp\L-p^ = R-p^[Q$^'^^^]-^L-p\ It seems likely 
that if the for these two extremal functions are equal, then all of them are. 

Unlike the case of (0), we do expect that the supremum for 

is actually attained. Indeed, we know that for each hxed P the supremum in 
(ffil) is attained for some A ^ 0 which satishes the eigenvalue problem (^). As 
before, we can hnd a maximizing sequence of density matrices P^ for o so that 
r7^iem(0) = limfc^oo AK^, Pk)- For each P^, let A^. be the solution to the eigenvalue 
problem ([ 7 ^) for A 2 ( 0 , Pfc) normalized so that Tr|Afc| = 1. Then we can hnd a 
convergent subsequence for which P^ —a P G P and —a A 7 ^ 0 since Tr|A| = 1. 

It then follows that (j 7 ^) holds for this P, A with A = 77^^®™ (0) (although P is only 
non-negative) which implies 


4.3 Bounds on contraction coefficients 

We hrst give an upper bound for using 

AST*© ll’|A| 

This can be interpreted as the norm of 0 regarded as an operator on the Banach 
space of traceless matrices with norm Tr|A|. Although the function g{'w) = |ta —1| 
is not operator convex, T^Dobrushmj'^^ jg analogous to the contraction coefficient of 
the (non-differentiable) symmetric relative g-entropy P|«,-i|(P, Q) = Tr|P — Q\ 
which, however, is not the relative g-entropy obtained by using g{w) = |tc — 1| 
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in Definition |2.1| . Nevertheless, j^Dobmshm^^^ jg ^ natural and useful object to 
consider. It was shown in (see Theorem 2) that 


^Dobrushin 


V 


(0) = - sup{Tr|0(i? — F)\ : E,F 1-dim projs; EF = 0} 


(78) 


where “1-dim projs” means that E, F are one-dimensional projections in T>. 
The expression on the right in (|78[) shows that we are justified in interpreting 
^Dobrushm^ 0 ^ as a uou-commutative analogue of Dobrushin’s coefficient of ergod- 
icity. 


Theorem 4.6 If (f) is stochastic, 


^Riem / 


hio7"(0) < V 


Dobrushin / _ 


( 0 ) = sup 


Tr|</.(21)| 


A^nv Tt\A\ 


(79) 


Proof: The map B (Dp®)"^ o 0 o = ^iog(-B) is positivity-preserving, 

as well as trace-preserving. The former follows from the integral representations 
(^^ and (^ID for Dp® and its inverse together with the fact that the composition 
of positivity-preserving maps is positive-preserving. Then taking the trace of the 
absolute value of both sides of the eigenvalue problem ^[^(A)] = \A and using 
Theorem 1 of yields 


ATr|7l| = Tr|<l'[0(7l)]| < Tr|0(7l)|. QED 


(80) 


Although we believe that this result holds for any we do not have a proof 
except for the log. Our proof depended on the observation that in the case of the 
log the map ^g{B) = (Dp)“^ o 0 o D^^pj(i?) is positivity-preserving. However, 
explicit examples can be found to show that is not positivity preserving in 
general. Indeed, although both Dp"''®® = [Rp + Lp]~^ and D^~^^ = Rff + Lff are 
positive semi-dehnite with respect to the Hilbert-Schmidt inner product, they are 
not positivity preserving in the sense of mapping positive operators to positive 
operators. The difference is analogous to the difference between an ordinary 
matrix being positive semi-dehnite and having positive elements. 


We now consider lower bounds on r 7 ^'®'"( 0 ). In it was shown that 


^Dobrushin / 


T] 


(0) < 


(81) 


We now give a lower bound which holds for all when the map 0 is unital, 

i.e., 0(1) = I. 

Theorem 4.7 7/0 is unital, 




Tr|0(A)p 


T.A=o Tr|A|^ 


( 82 ) 
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This is an immediate consequence of the dehnition (|65D; it also follows from 
Theorem and the fact that the right side of (|8^ is just A2(0,/) when cj) is 
unitah The right side of (|8^ ) can also be interpreted as the square of the norm 
of 0 regarded as an operator on the Banach space of traceless matrices with 
Hilbert-Schmidt norm vTrT04. When 0 is self-adjoint in the sense 0 = 0, every 
trace-preserving map is unitah 

If 0 maps to itself, then the results of this section can be restated in 

terms of the eigenvalues and singular values of 0. Since 0 is trace-preserving, 
4>{B) = AB implies that either A = 1 or Tri? = 0. If we restrict 0 to the matrices 
with trace zero, then j^Dobmshm^^^ jg largest magnitude of an eigenvalue and 
for unital 0 A2(0,/) is the largest eigenvalue of 00. Thus for unital stochastic 
maps, A 2 ( 0 ,/) = A 2 ( 00 ) where we have continued our convention of using the 
subscript 2 for eigenvalues of maps restricted to T^V. If 0 is self-adjoint, the 
two lower bounds (^) and (^2]) coincide and A 2 ( 0 ,/) = A 2 ( 00 ) = [A 2 ( 0 )]^ in the 
usual sense of second largest eigenvalue of. For general unital 0, (|82|) is stronger 
since 


hpi-“)2(0) > A2(0,/) = As (00) > 


^Dobrushirn-^^ 


(83) 


We now explicitly state some conjectures which have already been discussed. 


Conjecture 4.8 For each fixed g ^ G, 


Conjecture 4.9 If is unital, then 


V 


Riem 

9 


As (00) 


Tr|0(A)P 

rp I 412 

TrA=0 Tr|Ap 


(85) 


for all g ^ Q. 

If this conjecture holds, then for unital 0 the contraction coefficient is 

independent of g. Theorem ( [4.13| ) at the end of the next section contains an 
explicit example of a non-unital stochastic map for which depends non- 

trivially on g-, therefore, the hypothesis that 0 be unital is essential. In view of 
(^ it would suffice to show that < As (00) 








4.4 Examples 

We now consider some special classes of stochastic maps (f) ■. Ai ^ A 2 - We begin 
by looking at some maps for which all contraction coefficients are easily seen to 
be zero or one. We then consider maps from to which provide support 
for the conjecutres above. 

We hrst consider the case in which A 2 is one-dimensional, e.g., 0 projects onto 
a one-dimensional subalgebra (which need not have an identity) of Ai. Then, since 
0 is trace-preserving and maps density matrices to density matrices, we must have 
0(P) = 0(Q) y P,Q with Tr0(P) = 1 so that 0(P) 7 ^ 0. Thus, Pg[0(P), 0(Q)] = 
Dg[(P{P),(P{Q)] = 0 V P,Q which implies = ??f°'^(0) = 0. If TrP = 0, 

then 0(P) = 0. (To see this note that one can hnd a, b such that P = (aJ -|- bB) 
is a density matrix.) Thus (0(P)f2^(p)0(P)) = 0 and Tr|0(P)| = 0 for all B in 
which implies ? 7 ^^®™( 0 ) = j^Dobrushm^^^ _ g_ summarize this as 

Theorem 4.10 If the image of the stochastic map 0 is one-dimensional, then 

^RelEnt('0) ^ = ??f°'^( 0 ) = ^ q 

We next consider the important special case in which 0 is a partial trace r. 
In the simplest case, let r : be the map which takes 

^ ^ ^ ^t{M) = A + D (86) 



where M G has been written in block form and A,B,C,D G 

Then the homogeneity of relative entropy (see Dehnition ^.3|b) implies that for 


P 0 
0 P 


and Q = 


Q 0 


0 Q J 

Hg{P, Q) = Hg{2P, 2Q) = HgiriP), r(Q)) 
for any g, and similarly 

{A,nf,(A)) = {2A,n^p(2A)) = {T(A),n»|p,(T(A))) 
' A 0 ■ 


when A = 


0 A 


From this, we easily see that 


= 1, (87) 

where we have assumed implicitly that r acts on the full algebra of all 2n x 2n 
matrices. 

The partial trace described above is similar to a conditional expectation, i.e., 
a map for which A 2 is a subalgebra (with identity) of Ai and 0(A) = A V A G A, 2 . 
Both partial traces and conditional expectations are included in the following 
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Theorem 4.11 If the stochastic map (f is also an isomorphism from a non-trivial 
subalgebra (with identity) of Ai to A 2 , then ^ ^ ^ 

^Dobrushinj-^j = I for all Q & G. 


Since every completely positive map can be represented as a partial trace |]K 
this might seem to suggest that rj = 1 always holds. However, these representa¬ 
tions involve multiple copies of the algebra, so that the partial trace is not acting 
on the full algebra in the higher dimensional space. Thus, the representation of 
Ai need necessarily not contain a subalgebra with the desired isomorphism prop¬ 
erty. Examples of maps with p < 1 were already found in for commutative 
algebras, and two different non-commutative examples are given below. 

We now state two results for maps (f : The proofs are postponed 

to a subsequent paper ||^. Recall that any density matrix in can be written 


in the form ^[I + w-cx] where w G and a denote the vector of Pauli matrices. 
The first theorem provides evidence for the two conjectures at the end of the 
previous section. 


Theorem 4.12 For the unital map ^ -f + w-a —> / Tw-cx, 

^RelEnt^^T) = = UTI^ V ^ G 

and r/D°brushm(' 0 ^) ^ ||rj||^ 

The next example gives a non-unital stochastic map for which 77 ^'®“ (0) varies 
with g. For a, r > 0 with a -f- r < 1, define 

Goi,t[I + w-cr] = / -1- awiai + r(X 2 . (88) 


It is easily seen to be stochastic because the condition a -|- r < 1 insures that it 
is a convex combination of stochastic maps. For gso{w) = {w — 1 )^/{w + sq) as in 
Example 3 of section ^ 

^ 7 ( 0 ) = sup 


[(l-r2 + (p-a>2] 


> 


0<tJ<l [1 - - a2^;2j _ ^2(^1 _ p-^ _ (^i _ p)q,2^;2] 


1 - (is) 


where 1 — p = und equality holds for sq ~ 0. In particular, we can conlude 


Theorem 4.13 For the non-unital stochastic map cj) given by there is an 
S > 0 such that for sq G [0, S), 


p 


Riem 

so 


{(>) = 
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Furthermore 




a 


1 — F 


< a = 7] 


Dobrushin 


( 0 ). 


If Si G (0, S'), we have = iZ^- 
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